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Abstract 



We investigate the exploration problem of a short-sighted mobile 
robot moving in an unknown cellular room. To explore a cell, the robot 
must enter it. Once inside, the robot knows which of the 4 adjacent 
cells exist and which are boundary edges. The robot starts from a 
specified cell adjacent to the room's outer wall; it visits each cell, and 
returns to the start. Our interest is in a short exploration tour; that 
is, in keeping the number of multiple cell visits small. For abitrary 
environments containing no obstacles we provide a strategy producing 
tours of length 

5" < C + 

and for environments containing obstacles we provide a strategy, that 
is bound by 

S<C+^E + 3H + Wc^-2, 

where C denotes the number of cells — the area — , E denotes the 
number of boundary edges — the perimeter — , and H is the number of 
obstacles, and Wcw is a measure for the sinuosity of the given environ- 
ment. 

The strategics were implemented in a Java applet |13| that can be 
found in 

http : //www . geometrylab . de/ Gridrobot/ 

Key words: Robot navigation. Online algorithms, competitive anal- 
ysis, unknown environment, obstacles. 



1 Introduction 



Exploring an unknown environment and searching for a target in unknown 
position are among the basic tasks of autonomous mobile robots. Both 
problems have received a lot of attention in computational geometry and in 
robotics; see, for example, I2l[a[71[l0l[ni[l6l[171[l8l|2ll|22l[26]. 

In general, one assumes that the robot is equipped with an ideal vision 
system that provides, in a continuous way, the full visibility polygon of the 
robot's current position. In practice, there are at least two reasons that 
limit the reach of this model. 

• A realistic laser scanner can reliably detect objects only within a dis- 
tance of up to a few meters. Hence, in a large environment the robot 
has to move towards more distant areas to explore them. 

• Service robots like lawn mowers or cleaning devices need to get close 
to the parts of the environment they want to inspect and, possibly, to 
work on. 

To accomodate these situations, we study in this paper the model of a 
rather short-sighted robot. We assume that the environment is given by 
a simple polygon, P. Its edges are either vertical or horizontal, and its 
vertices have integer coordinates. Thus, the interior of P consists of integer 
grid cells. An interior cell is either free or blocked 

The robot starts from a free cell, s, adjacent to the polygon's boundary. 
From there it can enter one of the neighboring free cells, and so on. Once 
inside a cell, the robot knows which of its 4 neighbors are blocked and which 
are free. The robot's task is to visit each free cell inside P and to return to 
the start; see Figure [I](i) for an example. This example shows a tour that 
visits each cell once and some cells even more often. Our interest is in a 
short exploration tour, so we would like to keep the number of excess cell 
visits small. 

Even though our robot does not know its environment in advance, it is 
interesting to ask how short a tour can be found in the offline situation (i.e., 
when the environment is already known). This amounts to constructing a 
shortest traveling salesperson tour on the free cells. 

If the polygonal environment contains obstacles, the problem of finding 
such a minimum length tour is known to be NP-hard, by work of Itai et 
al. [19]. There are 1+e approximation schemes by Grigni et al. [12], Arora [3], 
and Mitchell [20], and a |g approximation by Arkin et al. [3]. 

In a simple polygon without obstacles, the complexity of constructing 
offline a minimum length tour seems to be open. Ntafos [23] and Arkin 

^For convenience, we consider the exterior of the polygon to consist of blocked cells 
only. 
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(i) 



(li) 



Figure 1: (i) An example exploration tour, (ii) A shortest TSP tour for the same 
polygon. 



et al. [3] have shown how to approximate the minimum length tour with 
factors of | and |, respectively. Umans and Lenhart [27] have provided an 
O(C^) algorithm for deciding if there exists a Hamiltonian cycle (i.e., a tour 
that visits each of the C cells of a polygon exactly once). For the related 
problem of Hamiltonian paths (i.e., different start and end positions), Ev- 
erett [S] has given a polynomial algorithm for certain grid graphs. Cho and 
Zelikovsky [15] studied spanning closed trails (a relaxation of Hamiltonian 
cyles) in grid graphsJl 

In this paper, our interest is in the online version of the cell exploration 
problem. The task of exploring an grid polygon with holes was indepen- 
dently considered by Gabriely and Rimon [11]. They introduce a somehow 
artificial robot model by distinguishing between the robot and its tool. The 
tool has the size of one grid cell and moves from cell to cell. The robot 
moves between the midpoints of 2 x 2-blocks of cells. The robot constructs 
a spanning tree on the 2x2 cell-blocks and the tools explores the polygon 
keeping the spanning tree edges on its right side. Only if the tool's path is 
blocked by an obstacle, it switches to the other side of the spanning tree and 
keeps the spanning tree edges on it's left side. This model allows a smart 
analysis yielding an upper bound of C + -B, where C denotes the number 
of cells and B the number of boundary cells. However, this bound is larger 
than C + + 3H + Wcw — 2 except for corridors of width one, in which 
both bounds are the same, this may justify our more complicated analysis 
of the strategy. 

Another online task is the piecemeal exploration, where the robot has to 
interrupt the exploration every now and then so as to return to the start 

^The grid graph corresponding to a grid polygon, P, consists of one node for every free 
cell in P. Two nodes are connected by an edge, if their corresponding cells are adjacent. 
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point, for example, to refuel. Piecemeal exploration of grid graphs was 
studied by Betke et al. [6] and Albers et al. [1]. Note that their objective is 
to visit every node and every edge, whereas we require a complete coverage 
of only the cells. Subdividing the robot's environment into grid cells is used 
also in the robotics community, see, for example, Moravec and Elfes [23], 
and Elfes [8]. 

Our paper is organized as follows: In Section [5] we give more detailled 
description of our robot and the environment. We analyze the competitive 
complexity in Section O showing lower bounds for both simple polygons and 
polygons with holes. In Section U] we present an exploration strategy for 
simple polygons together with the analysis of the strategy and in Section [5] 
we present and analyze a strategy for polygons with holes. 



2 Definitions 











A 
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(l) 



(ii) 



Figure 2: (i) Polygon with 23 cells, 38 edges and one(!) hole (black cells), (ii) the 
robot can determine which of the 4 adjacent cells are free, and enter an adjacent 
free cell. 



Definition 1 A cell is a basic block in our environment, defined by a 
tuple {x,y) G IN^. A cell is either free and can be visited by the robot, or 
blocked (i.e., unaccessible for the robot) H We call two cells ci = (a^i, yi), C2 = 
(3^2; 2/2) adjacent or neighboring, if they share a common edge (i.e., if \xi — 
3^2! + |yi — 2/2! = 1 holds), and touching, if they share a common edge or 
corner. 

A path, n, from a cell s to a cell i is a sequence of free cells s = 
c\,...,Cn = t where Cj and q+i are adjacent for i = l,...,n — 1. Let 
|n| denote the length of II. We assume that the cells have unit size, so the 
length of the path is equal to the number of steps from cell to cell that the 
robot walks. 

A grid polygon, P, is a connected set of free cells; that is, for every 
c\,C2 G P exists a path from ci to C2 that lies completely in P. 

We call a set of touching blocked cells that are completely surrounded 
by free cells an obstacle or hole, see Figure [2j Polygons without holes are 
called simple polygons. 
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C = 43 



E = 86 = 2C 



E = 'S4« 2C 



Figure 3: The perimeter, E, is used to distinguish between thin and thick environ- 
ments. 



We analyze the performance of an exploration strategy using some pa- 
rameters of the grid polygon. In addition to the area, C, of a polygon we 
introduce the perimeter, E. C is the number of free cells and E is the total 
number of edges that appear between a free cell and a blocked cell, see, 
for example. Figure [5] or Figure El We use E to distinguish between thin 
and thick environments, see Section [3l In Section 15.21 we introduce another 
parameter, the sinuosity VFcwi to distinguish between straight and twisted 
polygons. 

3 Competitive Complexity 

We are interested in an online exploration. In this setting, the environment 
is not known to the robot in advance. Thus, the first question is whether 
the robot is still able to approximate the optimum solution up to a constant 
factor in this setting. There is a quick and rather simple answer to this 
question: 

Theorem 2 The competitive complexity of exploring an unknown cellular 
environment with obstacles is equal to 2. 

Proof. Even if the environment is unknown we can apply a simple depth- 
first search algorithm (DFS) to the grid graph. This results in a complete 
exploration in 2C — 2 steps. The shortest tour needs at least C steps to visit 
all cells and to return to s, so DFS is competitive with a factor of 2. 

On the other hand, 2 is also a lower bound for the competitive factor of 
any strategy. To prove this, we construct a special grid polygon depending 
on the behavior of the strategy. The start position, s, is located in a long 
corridor of width 1. We fix a large number, Q, and observe how the strategy 
explores this corridor. Two cases occur. 

^In the following, we sometimes use the terms free cells and cells synonymously. 
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Case 1: The robot eventually returns to s after walking at least Q and 
at most 2Q steps. At this time, we close the corridor with two unvisited 
cells, one at each end, see Figure Hl^i). Let R be the number of cells visited 
so far. The robot has already walked at least 2R — 2 steps and needs another 
2R steps to visit the two remaining cells and to return to s, whereas the 
shortest tour needs only 2R steps to accomplish this task. 

N R 5H 

(i) I ■ : : : : : : : : 1 
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Figure 4: A lower bound of 2 for the exploration of grid polygons. 

Case 2: In the remaining case the robot concentrates — more or less — 
on one end of the corridor. Let R be the number of cells visited after 2Q 
steps. Now, we add a bifurcation at a cell b immediately behind the farthest 
visited cell in the corridor, see Figure Hl^ii) . Two paths arise, which turn 
back and run parallel to the long corridor. If the robots returns to s before 
exploring one of the two paths an argument analogous to case 1 applies. 
Otherwise, one of the two paths will eventually be explored up to the cell 
e where it turns out that this corridor is connected to the other end of the 
first corridor. At this time, the other path is defined to be a dead end of 
length R', which closes just one cell behind the last visited cell e'. 

From e the robot still has to walk to the other end of the corridor, to 
visit the dead end, and to return to s. Altogether, it will have walked at 
least four times the length of the corridor, R, plus four times the length 
of the dead end, R' . The optimal path needs only 2R + 2R' , apart from a 
constant number of steps for the vertical segments. 

In any case, the lower bound for the number of steps tends to 2 while Q 
goes to infinity. □ 

We cannot apply Theorem [2] to simple polygons, because we used a polygon 
with a hole to show the lower bound. The following lower bound holds for 
simple polygons. 

Theorem 3 Every strategy for the exploration of a simple grid polygon 
with C cells needs at least I C steps. 
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(iv) (v) (vi) (vii) 



Figure 5: A lower bound for the exploration of simple polygons. The dashed lines 
show the optimal solution. 

Proof. We assume that the robot starts in a corner of the polygon, see Fig- 
ure [5|^i) where A denotes the robot's position. Let us assume, the strategy 
decides to walk one step to the east — if the strategy walks to the south we 
use a mirrored construction. For the second step, the strategy has two pos- 
sibilities: Either it leaves the wall with a step to the south, see Figure [5|^ii) , 
or it continues to follow the wall with a further step to the east, see Fig- 
ure [5]^iii). In the first case, we close the polygon as shown in Figure [5](^iv) . 
The robot needs at least 8 steps to explore this polygon, but the optimal 
strategy needs only 6 steps yielding a factor of | 1.3. In the second case 
we proceed as follows. If the robot leaves the boundary, we close the polygon 
as shown in Figure [5jv) and (vi). The robot needs 12 step, but 10 steps are 
sufficient. In the most interesting case, the robot still follows the wall, see 
Figure [5^ vii) . In this case, the robot will need at least 28 steps to explore 
this polygon, whereas an optimal strategy needs only 24 steps. This leaves 
us with a factor of || = | « 1.16. 

We can easily extend this pattern to build polygons of arbitrary size by 
repeating the preceding construction several times using the entry and exit 
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cells denoted by the arrows in Figure [5][^iv)-(vii). As soon as the robot leaves 
one block, it enters the start cell of the next block and the game starts again; 
that is, we build the next block depending on the robot's behavior. Note 
that this construction cannot lead to overlapping polygons or polygons with 
holes, because the polygon always extends to the same direction. □ 
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Figure 6: DFS is not the best possible strategy. 



Even though we have seen in Theorem [5] that the simple DFS strategy 
already achieves the optimal competitive factor in polygons with holes, DFS 
is not the best possible exploration strategy! There is no reason to visit each 
cell twice just because this is required in some special situations like dead 
ends of width 1. Instead, a strategy should make use of wider areas, see 
Figure [6l 

We use the perimeter, E, to distinguish between thin environments that 
have many corridors of width 1, and thick environments that have wider 
areas, see Figure [3] on page HI In the following sections we present strategies 
that explore grid polygons using no more than roughly C + steps. Since 
all cells in the environment have to be visited, C is a lower bound on the 
number of steps that are needed to explore the whole polygon and to return 
to 50 Thus, ^ \E is an upper bound for the number of additional cell 
visits. For thick environments, the value of E is in 0{^fC)^ so that the 
number of additional cell visits is substantially smaller than the number of 
free cells. Only for polygons that do not contain any 2x2 square of free cells, 
E achieves its maximum value of 2(C + 1), and our upper bound is equal 
to 2C — 2, which is the cost of applying DFS. But in this case one cannot 
do better, because even the optimal offline strategy needs that number of 
steps. In other cases, our strategies are more efficient than DFS. 



^More precisely, we need at least C — 1 steps to visit every cell, and at least 1 step to 
return to s. 
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4 Exploring Simple Polygons 



We have seen in the previous section that a simple DFS traversal achieves 
a competitive factor of 2. Because the lower bound for simple grid poly- 
gons is substantially smaller, there may be a strategy that yields a better 
factor. Indeed, we can improve the DFS strategy. In this section, we give 
a precise description of DFS and present two improvements that lead to a 
|-competitive exploration strategy for simple polygons. 

4.1 An Exploration Strategy 

There are four possible directions — north, south, east and west — for the 
robot to move from one cell to an adjacent cell. We use the command 
move (dir) to execute the actual motion of the robot. The function un- 
explored (dii) returns true, if the cell in the given direction seen from the 
robot's current position is not yet visited, and false otherwise. For a given 
direction dir, cw(diT) denotes the direction turned 90° clockwise, ccw(diT) 
the direction turned 90° counterclockwise, and reverse(dii) the direction 
turned by 180°. 

Using these basic commands, the simple DFS strategy can be imple- 
mented as shown in Algorithm 14. li For every cell that is entered in direction 
dir, the robot tries to visit the adjacent cells in clockwise order, see the pro- 
cedure ExploreCell. If the adjacent cell is still unexplored, the robot enters 
this cell, recursively calls ExploreCell, and walks back, see the procedure Ex- 
ploreStep. Altogether, the polygon is explored following the left-hand rule: 
The robot proceeds from one unexplored cell to the next while the polygon's 
boundary or the explored cells are always to its left hand side. 

Obviously, all cells are visited, because the graph is connected, and the 
whole path consists of 2C — 2 steps, because each cell — except for the start — 
is entered exactly once by the first move statement, and left exactly once by 
the second move statement in the procedure ExploreStep. 




Figure 7: First improvement to DFS: Return directly to those cells that still have 
unexplored neighbors. 

The first improvement to the simple DFS is to return directly to those 
cells that have unexplored neighbors. See, for example. Figure [T) After 
the robot has reached the cell ci, DFS walks to C2 through the completely 
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Algorithm 4.1 DFS 



DFS(P, start): 

Choose dhection dir, so that reverse(c?zr) points to a blocked cell; 
ExploreCell( dir) ; 

ExploreCell( rfir ) : 

// Left-Hand Rule: 
ExploreStep (ccw ( dir) ) ; 
ExploreStep(dir); 
ExploreStep(cw( dir)) ; 

ExploreStep(dzr): 

if unexplored (dir) then 

move(dir); 

ExploreCell(dir); 

move(reverse( dir) ) ; 
end if 



explored corridor of width 2. A more efficient return path walks on a shortest 
path from ci to C2- Note that the robot can use for this shortest path only 
cells that are already known. With this modification, the robot's position 
might change between two calls of ExploreStep. Therefore, the procedure 
ExploreCell has to store the current position, and the robot has to walk on 
the shortest path to this cell, see the procedure ExploreStep in Algorithm [121 
The function unexplored (cell, dii) returns true, if the cell in direction dir 
from cell is not yet visited. 
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Figure 8: Second improvement to DFS; Detect polygon splits. 

Now, observe the polygon shown in Figure [8l DFS completely surrounds 
the polygon, returns to C2 and explores the left part of the polygon. After 
this, it walks to ci and explores the right part. Altogether, the robot walks 
four times through the narrow corridor. A more clever solution would ex- 
plore the right part immediately after the first visit of ci , and continue with 
the left part after this. This solution would walk only two times through the 
corridor in the middle! The cell ci has the property that the graph of unvis- 
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ited cells splits into two components after ci is explored. We call cells like 
this split cells. The second improvement to DFS is to recognize split cells 
and diverge from the left-hand rule when a split cell is detected. Essentially, 
we want to split the set of cells into several components, which are finished 
in the reversed order of their distances to the start cell. The detection and 
handling of split cells is specified in Section [321 Algorithm 14.21 resumes both 
improvements to DFS. 
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(i) (ii) 

Figure 9: Straightforward strategies are not better than SmartDFS. 

Note that the straightforward strategy Visit all boundary cells and cal- 
culate the optimal offline path for the rest of the polygon does not achieve a 
competitive factor better than 2. For example, in Figure [U|^i) this strategy 
visits almost every boundary cell twice, whereas SmartDFS visits only one 
cell twice. Even if we extend the simple strategy to detect split cells while 
visiting the boundary cells, we can not achieve a factor better than |. A 
lower bound on the performace of this strategy is a corridor of width 3, 
see Figure [9|^ii). Moreover, it is not known whether the offline strategy is 
NP-hard for simple polygons. 

4.2 The Analysis of SmartDFS 

SmartDFS explores the polygon in layers: Beginning with the cells along the 
boundary, SmartDFS proceeds towards the interior of P. Let us number the 
single layers: 

Definition 4 Let P be a (simple) grid polygon. The boundary cells of P 
uniquely define the first layer of P. The polygon P without its first layer 
is called the 1-offset of P. The £th layer and the ^-offset of P are defined 
successively, see Figure [lOl 

Note that the ^-offset of a polygon P is not necessarily connected. Al- 
though the preceding definition is independent from any strategy, SmartDFS 
can determine a cell's layer when the cell is visited for the first time. We can 
define the ^-offset in the same way for a polygon with holes, but the layer of 
a given cell can no longer be determined on the first visit in this case. The 
^-offset has an important property: 

Lemma 5 The i-offset of a simple grid polygon, P, has at least 8£ edges 
fewer than P. 
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Algorithm 4.2 SmartDFS 

SmartDFS(P, start): 

Choose direction dir for the robot, so that reverse( dir) points to 

a blocked ceh; 
ExploreCen(c?ir); 

Walk on the shortest path to the start cell; 

ExploreCell( dir ) : 

Mark the current cell with the number of the current layer; 
base := current position; 
if not isSplitCell( 6ase) then 

// Left-Hand Rule: 

ExploreStep (6ase, ccw{dir)); 

ExploreStep (6ase, dir); 

ExploreStep( 6ase, cw (dir)); 
else 

/ / choose different order, see page [Hl ff 

Determine the types of the components using the layer numbers 

of the surrounding cells; 
if No component of type III exists then 

Use the left-hand rule, but omit the first possible step, 
else 

Visit the component of type III at last, 
end if 
end if 

ExploreStep(6ase, dir): 

if unexplored (&ase, dir) then 

Walk on shortest path using known cells to base; 

move(dir); 

ExploreCell(dir); 
end if 
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Figure 10: The 2-ofFset (shaded) of a grid polygon P. 

Proof. First, we can cut off blind alleys that are narrower than 2£, because 
those parts of P do not affect the ^-offset. We walk clockwise around the 
boundary cells of the remaining polygon, see Figure [TOj For every left turn 
the offset gains at most 2i edges and for every right turn the offset looses at 
least 2i edges. O'Rourke [25] showed that ^^vertices = 2 -^reflex vertices + 4 
holds for orthogonal polygons, so there are four more right turns than left 
turns. □ 

Definition m allows us to specify the detection and handling of a split cell 
in SmartDFS. We start with the handling of a split cell and defer split cell 
detection. 

Let us consider the situation shown in Figure [TTIfi) to explain the han- 
dling of a split cell. SmartDFS has just met the first split cell, c, in the 
fourth layer of P. P divides into three parts: 

P = i^i U ^2 U { visited cells of P }, 

where Ki and K2 denote the connected components of the set of unvisited 
cells. In this case it is reasonable to explore the component K2 first, because 
the start cell s is closer to Ki; that is, we can extend Ki with i layers, such 
that the resulting polygon contains the start cell s. 

More generally, we want to divide our polygon P into two parts, Pi 
and P2, so that each of them is an extension of the two components. Both 
polygons overlap in the area around the split cell c. At least one of these 
polygons contains the start cell. If only one of the polygons contains s, we 
want our strategy to explore this part at last, expecting that in this part 
the path from the last visited cell back to s is the shorter than in the other 
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Figure 11: A decomposition of P at the split cell c and its handling in SmartDFS. 



part. Vice versa, if there is a polygon that does not contain s, we explore the 
corresponding component first. In Figure [TTl SmartDFS recursively enters 
K2, returns to the split cell c, and explores the component Ki next. 

In the preceding example, there is only one split cell in P, but in general 
there will be a sequence of split cells, ci, . . . , c^. In this case, we apply the 
handling of split cells in a recursive way; that is, if a split cell Cj+i, 1 < i < k, 
is detected in one of the two components occurring at q we proceed the same 
way as described earlier. Only the role of the start cell is now played by 
the preceding split cell Cj. In the following, the term start cell always refers 
to the start cell of the current component; that is, either to s or to the 
previously detected split cell. Further, it may occur that three components 
arise at a split cell, see Figure fT^ i) on page [151 We handle this case as two 
successive polygon splits occurring at the same split cell. 
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Figure 12: Several types of coniponents. 



Visiting Order 

We use the layer numbers to decide which component we have to visit at 
last. Whenever a split cell occurs in layer i, every component is one of the 
following types, see Figure [T2t 

I. Ki is completely surrounded by layer ^ 
II. Ki is not surrounded by layer £ 
III. Ki is partially surrounded by layer i 



first cell in layer 2 ^ ' 
first cell in layer 3 



Layer 3 
Layer 2 
Layer 1 



Figure 13: Switching the current layer. 



There are two cases, in which SmartDFS switches from a layer ^ — 1 to 
layer i. Either it reaches the first cell of layer £ — 1 in the current component 
and thus passes the start cell — see, for example, the switch from layer 1 to 
layer 2 in Figure [13] — , or it hits another cell of layer i — 1 but no polygon 
split occurs, such as the switch from layer 2 to layer 3 in in Figure [T3l In the 
second case, the considered start cell must be located in a narrow passage 
that is completely explored; otherwise, the strategy would be able to reach 



^More precisely, the part of layer £ that surrounds Ki is completely visited. For con- 
venience, we use the slightly sloppy, but shorter form. 
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the first cell of layer ^ — 1 as in the first case. In both cases the part of P 
surrounding a component of type III contains the first cell of the current 
layer i as well as the start cell. Therefore, it is reasonable to explore the 
component of type III at last. 

There are two cases, in which no component of type III exists when a 
split cell is detected: 

1. The part of the polygon that contains the preceding start cell is ex- 
plored completely, see for example Figure [T^i). In this case the order 
of the components makes no differencelfl 

2. Both components are completely surrounded by a layer, because the 
polygon split and the switch from one layer to the next occurs within 
the same cell, see Figure [Hlfii). A step that follows the left-hand 
rule will move towards the start cell, so we just omit this step. More 
precisely, if the the robot can walk to the left, we prefer a step forward 
to a step to the right. If the robot cannot walk to the left but straight 
forward, we proceed with a step to the right. 

We proceed with the rule in case 2 whenever there is no component of 
type III, because the order in case 1 does not make a difference. 



Layer 2 
Layer 1 
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(i) (ii) 
Figure 14: No component of type III exists. 



®In Figure 114^ we gain two steps, if we explore the part left to the splitcell at last 
and do not return to the split cell after this part is completely explored, but return 
immediately to the start cell. But decisions like this require facts of much more global 
type than we consider up to now. However, for the analysis of our strategy and the upper 
bound shortcuts like this do not matter. 
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An Upper Bound on the Number of Steps 

For the analysis of our strategy we consider two polygons, Pi and P2, as 
follows. Let Q be the square of width 2g + 1 around c with 

J i, if K2 is of type I 

\iK2 is of type II ' 

where K2 denotes the component that is explored first, and £ denotes the 
layer in which the split cell was found. We choose P2 C P U Q such that 
K2U{c} is the g-offset of P2, and Pi := ((P\P2)UQ)nP, see FigureHH The 
intersection with P is necessary, because Q may exceed the boundary of P. 
Note that at least Pi contains the preceding start cell. There is an arbitrary 
number of polygons P2, such that K2 U {c} is the q-offset of P2, because 
blind alleys of P2 that are not wider than 2g do not affect the g-offset. To 
ensure a unique choice of Pi and P2, we require that both Pi and P2 are 
connected, and both P U Q = Pi U P2 and Pi n P2 C Q are satisfied. 

The choice of Pi,P2 and Q ensures that the robot's path in Pi\Q and 
in P2\Q do not change compared to the path in P. The parts of the robot's 
path that lead from Pi to P2 and from P2 to Pi are fully contained in the 
square Q. Just the parts inside Q are bended to connect the appropriate 
paths inside Pi and P2, see Figure [TT] and Figure [T5l 




(i) (ii) 



Figure 15: The component K2 is of type I. The square Q may exceed P. 
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In Figure [TTl Ki is of type III and K2 is of type II. A component of type 
I occurs, if we detect a split cell as shown in Figure [T5j Note that Q may 
exceed P, but Pi and P2 are still well-defined. 



Q 



Layer 2 
Layer 1 



Figure 16: The order of components is not necessarily optimal. 

Remark that we do not guarantee that the path from the last visited cell 
back to the corresponding start cell is the shortest possible path. See, for 
example, Figure [T6l A split cell is met in layer 2. Following the preceding 
rule, SmartDFS enters K2 first, returns to c, explores Ki, and returns to 
s. A path that visits Ki first and moves from the upper cell in K2 to s is 
slightly shorter. A case like this may occur if the first cell of the current 
layer lies in Q. However, we guarantee that there is only one return path 
in Pi\Q and in P2\Q', that is, only one path leads from the last visited cell 
back to the preceding start cell causing double visits of cells. 

We want to visit every cell in the polygon and to return to s. Every 
strategy needs at least C{P) steps to fulfill this task, where C{P) denotes the 
number of cells in P. Thus, we can split the overall length of the exploration 
path, n, into two parts, C{P) and excess(P), with |n| = C{P) +excess(P). 
C{P) is a lower bound on the number of steps that are needed for the 
exploration task, whereas excess (P) is the number of additional cell visits. 

Because SmartDFS recursively explores K2 U {c}, we want to apply the 
upper bound inductively to the component K2 U {c}. If we explore Pi with 
SmartDFS until c is met, the set of unvisited cells of Pi is equal to Ki, 
because the path outside Q do not change. Thus, we can apply our bound 
inductively to Pi, too. The following lemma gives us the relation between 
the path lengths in P and the path lengths in the two components. 

Lemma 6 Let P he a simple grid polygon. Let the robot visit the first split 
cell, c, which splits the unvisited cells of P into two components Ki and K2, 
where K2 is of type I or IL With the preceding notations we have 

excess (P) < excess (Pi) + excess (-K'2 U {c}) + 1 . 

Proof. The strategy SmartDFS has reached the split cell c and explores 
K2 U {c} with start cell c first. Because c is the first split cell, there is 
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no excess in P2\{K2 U {c}) and it suffices to consider excess(Er2 U {c}) for 
this part of the polygon. After K2 U {c} is finished, the robot returns to 
c and explores Ki. For this part we take excess(Pi) into account. Finally, 
we add one single step, because the split cell c is visited twice: once, when 
SmartDFS detects the split and once more after the exploration of K2 U {c} 
is finished. Altogether, the given bound is achieved. □ 

c is the first split cell in P, so K2 U {c} is the g-offset of P2 and we can 
apply Lemma to bound the number of boundary edges of K2 U {c} by the 
number of boundary edges of P2- The following lemma allows us to charge 
the number of edges in Pi and P2 against the number of edges in P and Q. 

Lemma 7 Let P be a simple grid polygon, and let Pi, P2 and Q be defined 
as earlier. The number of edges satisfy the equation 

E{Pi) + E{P2) = E{P) + E{Q) . 

Proof. Obviously, two arbitrary polygons Pi and P2 always satisfy 

E{Pi) + E{P2) = E{Pi U P2) + S(Pi n P2) . 

Let Q' := Pi n P2. Note that Q' is not necessarily the same as Q, see, 
for example. Figure [15j With Pi U P2 = P U Q we have 

E{Pi) + E{P2) = ^(Pi n P2) + ^(Pi U P2) 
= E{Q') + E{PVJQ) 
= E{Q') + E{P)+E{Q)-E{PnQ) 
= E{P) + E{Q) 

The latter equation holds because Q' = P CiQ. □ 

Finally, we need an upper bound for the length of a path inside a grid 
polygon. 

Lemma 8 Let H be the shortest path between two cells in a grid polygon 
P. The length ofU is bounded by 

\U\ < ^E{P) - 2 . 

Proof. W.l.o.g. we can assume that the start cell, s, and the target cell, t, 
of n belong to the first layer of P, because we are searching for an upper 
bound for the shortest path between two arbitrary cells. 

Observe the path Hl from s to t in the first layer that follows the bound- 
ary of P clockwise and the path that follows the boundary counterclock- 
wise. The number of edges along these paths is at least four greater than 
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the number of cells visited by 11^ and 11^ using an argument similar to the 
proof of Lemma [5j Therefore we have: 

inLi + in^i <^(p)-4. 

In the worst case, both paths have the same length, so |n(s, t)\ = = 
\IIr\ holds. With this we have 

2-\U{s,t)\<E{P)-A =^ \U{s,t)\ < ^E{P)-2. 

□ 

Now, we are able to show our main theorem: 

Theorem 9 Let P be a simple grid polygon with C cells and E edges. P 
can he explored with 

steps. This hound is tight. 

Proof. C is the number of cells and thus a lower bound on the number of 
steps that are needed to explore the polygon P. We show by an induction 
on the number of components that excess(P) < — 3 holds. 

For the induction base we consider a polygon without any split cell: 
SmartDFS visits each cell and returns on the shortest path to the start cell. 
Because there is no polygon split, all cells of P can be visited by a path of 
length C — 1. By Lemma [8] the shortest path back to the start cell is not 
longer than ^E — 2; thus, excess(P) < ^E{P) — 3 holds. 

Now, we assume that there is more than one component during the 
application of SmartDFS. Let c be the first split cell detected in P. When 
SmartDFS reaches c, two new components, Ki and K2, occur. We consider 
the two polygons Pi and P2 defined as earlier, using the square Q around c. 

W.l.o.g. we assume that K2 is recursively explored first with c as start 
cell. After K2 is completely explored, SmartDFS proceeds with the remain- 
ing polygon. As shown in Lemma Owe have 

excess(P) < excess(Pi) + excess(iir2 U {c}) + 1 . 

Now, we apply the induction hypothesis to Pi and K2 U {c} and get 

excess(P) < \e{Pi) - 3 + ^E{K2 U {c}) - 3 + 1 . 

By applying Lemma [5] to the g-offset K2 U {c} of P2 we achieve 

excess(P) < ^E(Pi) - 3 + ^(^(Ps) - 8g) - 3 + 1 

= ^(E{Pi) + E{P2)) - 4q - 5 . 
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From Lemma [3 we conclude E{Pi) + E{P2) < E{P) + 4(2g + 1). Thus, we 
get excess(P) < \E{P) - 3. 

In Section [3] we have already seen that the bound is exactly achieved in 
polygons that do not contain any 2 x 2-square of free cells. □ 

Competitive Factor 

So far we have shown an upper bound on the number of steps needed to 
explore a polygon that depends on the number of cells and edges in the 
polygon. Now, we want to analyze SmartDFS in the competitive framework. 

Corridors of width 1 or 2 play a crucial role in the following, so we refer 
to them as narrow passages. More precisely, a cell, c, belongs to a narrow 
passage, if c can be removed without changing the layer number of any other 
ceU. 

It is easy to see that narrow passages are explored optimally: In corridors 
of width 1 both SmartDFS and the optimal strategy visit every cell twice, 
and in the other case both strategies visit every cell exactly once. 

We need two lemmata to show a competitive factor for SmartDFS. The 
first one gives us a relation between the number of cells and the number of 
edges for a special class of polygons. 

Lemma 10 For a simple grid polygon, P, with C{P) cells and E{P) edges, 
and without any narrow passage or split cells in the first layer, we have 

EiP)<^C{P) + 6. 

Proof. Consider a simple polygon, P. We successively remove a row or 
column of at least three boundary cells, maintaining our assumption that 
the polygon has no narrow passages or split cells in the first layer. These 
assumptions ensure that we can always find such a row or column (i.e., if we 
cannot find such a row or column, the polygon has a narrow passage or a 
split cell in the first layer). Thus, we remove at least three cells and at most 
two edges. This decomposition ends with a 3 x 3 block of cells that fulfills 
E = + 6. Now, we reverse our decomposition; that is, we successively 
add all rows and columns until we end up with P. In every step, we add 
at least three cells and at most two edges. Thus, £' < |C + 6 is fulfilled in 
every step. □ 

For the same class of polygons, we can show that SmartDFS behaves 
slightly better than the bound in Theorem [9l 

Lemma 11 A simple grid polygon, P, with C{P) cells and E[P) edges, 
and without any narrow passage or split cells in the first layer can he explored 
using no more steps than 

S{P)<C{P) + \E{P)-b. 
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Figure 17: For polygons without narrow passages or split cells in the first layer, the 
last explored cell, c', lies in the 1-offset, P' (shaded). 

Proof. In Theorem [9] we have seen that S{P) < C(P) + IE{P) - 3 holds. 
To show this theorem, we used Lemma [8] on page 1181 as an upper bound for 
the shortest path back from the last explored cell to the start cell. Lemma [8] 
bounds the shortest path from a cell, c, in the first layer of P to the cell 
c' that maximizes the distance to c inside P; thus, c' is located in the first 
layer of P, too. 

Because P has neither narrow passages nor split cells in the first layer, 
we can explore the first layer of P completely before we visit another layer, 
see Figure [T71 Therefore, the last explored cell, c', of P is located in the 
1-offset of P. Let P' denote the 1-offset of P, and s' the first visited cell in 
P' . Remark that s and s' are at least touching each other, so the length of 
a shortest path from s' to s is at most 2. Now, the shortest path, H, from 
c' to s in P is bounded by a shortest path, XT', from c' to s' in P' and a 
shortest path from s' to s: 

|n| < |n'| + 2. 

The path H', in turn, is bounded using Lemma [5] by 

|n'| < ^E{p') - 2. 

By Lemma [5] (page [TO]) . E{P') < E{P) — 8 holds, and altogether we get 

n<\E{p)-^, 

which is two steps shorter than stated in Lemma [HI □ 
Now, we can prove the following 

Theorem 12 The strategy SmartDFS is ^-competitive. 

Proof. Let P be a simple grid polygon. In the first stage, we remove 
all narrow passages from P and get a sequence of (sub-) polygons Pi, i = 
1, . . . ,k, without narrow passages. For every Pi, i = 1, . . . , k — 1, the optimal 
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strategy in P explores the part of P that corresponds to Pi up to the narrow 
passage that connects Pi with Pj+i, enters Pi+i, and fully explores every Pj 
with j > i. Then it returns to Pi and continues with the exploration of Pi. 
Further, we already know that narrow passages are explored optimally. This 
allows us to consider every Pi separately without changing the competitive 
factor of P. 

Now, we observe a (sub-)polygon Pi. We show by induction on the 
number of split cells in the first layer that S{Pi) < |C(Pj) — 2 holds. Note 
that this is exactly achieved in polygons of size 3xm, m even, see Figure \TE[ 




SmartDFS 



optimal strategy 



Figure 18: In a corridor of width 3 and even length, S{P) = | 5'opt(^') — 2 holds. 

If Pi has no split cell in the first layer (induction base), we can apply 
Lemma [11] and Lemma [TOl 



S{Pi) < C{Pi) + -E{Pi)-b 

- C{P^) + \^^C{Pi) + Q\-b 
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Figure 19: Three cases of split cells, (i) component of type II, (ii) and (iii) compo- 
nent of type I. 

Two cases occur if we meet a split cell, c, in the first layer, see Figure [T9l 
In the first case, the new component was never visited before (component 
of type II, see page [2]). Here, we define Q := {c}. The second case occurs, 
because the robot meets a cell, c', that is in the first layer and touches the 
current cell, c, see for example Figure fTWii) and (iii). Let Q be the smallest 
rectangle that contains both c and c'. 
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Similar to the proof of Theorem O we split the polygon Pi into two parts, 
both including Q. Let P" denote the part that includes the component of 
type I or II, P' the other part. For \Q\ = 1, see Figure fTW i). we conclude 
S{Pi) = S{P') + S{P") and C{Pi) = C{P') + C{P") - 1. Applying the 
induction hypothesis to P' and P" yields 

S{Pi) = S{P') + S{P") 

< ^C{P')-2 + ^C{P")-2 

= lc(P,) + ^-4 < lc{Pi)-2. 

For \Q\ £ {2,4} we gain some steps by merging the polygons. If we 
consider P' and P" separately, we count the steps from c' to c — or vice 
versa — in both polygons, but in Pi the path from c' to c is replaced by the 
exploration path in P". Thus, we have S{Pi) = S{P') + S{P") - \Q\ and 
C{Pi) = C{P') + C{P") - \Q\. This yields 

S{P,) = S{P') + S{P")-\Q\ 

< ^C{P')-2 + ^C{P")-2-\Q\ 

= lc(P,) + ^(|g|-6)-2 < ^C{P,)-2. 

The optimal strategy needs at least C steps, which, altogether, yields a 
competitive factor of 1. □ 



23 



Split Cell Detection 
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Figure 20: (i) Detecting a split cell, (ii) and (iii) a polygon split occurs in layer 1. 



Finally, we describe the detection of a split cell. In the first instance, 
let us assume that the robot already moves in a layer i > 1. In Section [4.11 
we defined that a split cell divides the graph of unexplored cells into two 
parts when the split cell is visited. Because the polygon is simple, we can 
determine a global split using a local criterion. We observe the eight cells 
surrounding the current robot's position. If there is more than one connected 
set of visited cells in this block, the current robot position is obviously a split 
cell, see Figure [20lfi). Remark that we can execute this test although the 
robot's sensors do not allow us to access all eight cells around the current 
position. We are interested only in visited cells, so we can use the robot's 
map of visited cells for our test in layers i > 1. Unfortunately, this test 
method fails in layer 1, because the robot does not know the "layer 0", the 
polygon walls. However, we want to visit the component that has no visited 
cell in the current layer (type II) first; therefore, a step that follows the 
left-hand rule is correct. The strategy behaves correctly, although we do 
not report the splitcell explicitly. See, for example, Figure [20^ 11) and (iii): 
In both cases the polygon split cannot be detected in c, because the cell 
marked with '?' is not known at this time. The split will be identified and 
handled correctly in c'. 
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5 Exploring Polygons with Holes 



In an environment with obstacles (holes) it is not obvious how to detect 
and to handle split cells. When a polygon split is detected, the robot may 
be far away from the split cell, because it had no chance to recognize the 
split before reaching its current position. For example, in Figure \2V( i) the 
robot has surrounded one single obstacle and c is a split cell, whereas in 
(ii) there are two obstacles and c is no split cell. Both situations cannot be 
distinguished until the cell c' is reached. So we use a different strategy to 
explore environments with obstacles. 




(i) (n) 

Figure 21: In an environment with obstacles, the robot may detect a split on a 
position far away from the splitcell, (i) c was a split cell, (ii) c was no split cell. 



5.1 An Exploration Strategy 

The basic idea of our strategy, CellExplore, is to reserve all cells right to the 
covered path for the way back. As in SmartDFS we use the left-hand rule; 
that is, the robot proceeds keeping the polygon's boundary or the reserved 
cells on its left side. CellExplore uses two modes. In the forward mode the 
robot enters undiscovered parts of the polygon, and in the backward mode 
the robot leaves known parts, see Algorithm 15.11 on page \Tl\ We require 
that the robot starts with its back to a wall. 



25 



1 23456789 10 



Figure 22: Example of an exploration tour produced by CellExplore (Screenshot 
using [13]; the white eells are holes, dark gray eells are reserved). 

Figure [22] shows an example of an exploration tour. The robot starts in 
Jl and explores the polygon in the forward mode until F8 is reached. There, 
the robot switches to the backward mode and explores the reserved cells FT- 
PS. The path from F5 to H5 is blocked by the hole in G5, so the robot walks 
on the cells F4-H4 which have been visited already in the forward mode. In 
H5 the robot discovers the unreserved and unexplored cell H6, switches back 
to the forward mode and explores the cells H6-H8. Note that no cells can 
be reserved in this case, because the cells 16-18 have already been reserved 
during the exploration of J8-J6. Therefore, the robot walks the same path 
back to H5 and continues the return path in the backward mode. In the 
forward mode, the robot could not reserve a cell from H3, so we move via 
H4 to 14 and proceed with the return path in the backward mode. The cells 
D9, C2 and G2 are blocked, so the robot has to circumvent these cells using 
visited cells. In C5 another unreserved and unexplored cell is discovered, so 
we switch to the forward mode and visit C4. 
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Algorithm 5.1 CellExplore 



Forward mode: 

• The polygon is explored following the left-hand rule: For every 
entered cell the robot tries to extend its path to an adjacent, 
unexplored, and unreserved cell, preferring a step to the lefl|j 
over a straight step over a step to the right. 

• All unexplored and unreserved cells right to the covered path are 
reserved for the return path by pushing them onto a stacklf] If no 
cell right to the robot's current position can be reserved — because 
there is a hole or the corresponding cell is already reserved or 
explored — the robot's position is pushed onto the stack for the 
return path. 

• Whenever no step in the forward mode is possible, the strategy 
enters the backward mode. 

Backward mode: 

• The robot walks back along the reserved return path. 

• Whenever an unexplored and unreserved cell appears adjacent to 
the current position, the forward mode is entered again. 



A straightforward improvement to the strategy CellExplore is to use 
in the backward mode the shortest path — on the cells known so far — to 
the first cell on the stack that is unexplored or has unexplored neighbors 
instead of walking back using every reserved cell, see the first improvement of 
DFS. From a practical point of view, this improvement is very reasonable, 
because the performance of the strategy increases in most environments. 
Unfortunately, the return path (i.e., the path walked in the backward mode) 
is no longer determined by a local configuration of cells. Instead, we need a 
global view, which complicates the analysis of this strategy. However, there 
are polygons that force this strategy to walk exactly the same return path as 
CellExplore without any optimization, see Figure [23l so this idea does not 

"step to the left" or "turn left" means that the robot turns 90° counterclockwise 
and moves one cell forward. Analogously with "step to the right" or "turn right". 

*If the robot turns left, we reserve three cells: right hand, straight forward, and forward- 
right to the robot's position. Note that we store the markers only in the robot's memory. 
This allows us to reserve cells that only touch the current cell, even if we are not able to 
determine whether these cells are free or blocked. 
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Figure 23: A polygon with C = 69, f = 52, iJ = 1, Wcw = 2, S = 124 = C + + 
3H + Wcw — 2. The return path in this polygon cannot be shortened. 

improve the worst case performance, and the upper bound for the number 
of steps is the same as in Theorem [181 

5.2 The Analysis of CellExplore 

We analyze CellExplore in three steps: First, 



local view and with one assumption concerning m 
the robot's initial position, see the figure on the 
right. This results in a bound that depends on the number of left turns 
that are needed to explore the polygon. Then we discard the assumption, 
and finally we consider some global arguments to replace the number of left 
turns with parameters of the polygon. 

To analyze our strategy CellExplore we use the following observations: 

• CellExplore introduces a dissection of all cells into cells that are ex- 
plored in the forward mode, denoted by T, and cells that are explored 
in the backward mode, B: C = T U B. 

• All cells that are explored in the forward mode can be uniquely clas- 
sified by the way the robot leaves them: either it makes a step to the 
left, a step forward or a step to the right. T = Ti,\J Tp U Tb.- 

• CellExplore defines a mapping : B — f J-", which assigns to every 
cell c^B one cell d E J^, so that c was reserved while d was visited. 

• There exists a subset T) ^B oi the cells that have an unexplored and 
unreserved neighbor and the strategy switches from backward mode 
to forward mode. We call these cells division cells. 



we analyze the single steps of the strategy with a 
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Figure 24: Decomposing a polygon. A denotes the start cell and the initial direc- 
tion. AC, AE and AS" denote the differences in the number of cells, edges, and 
steps, respectively. C denotes the balance. 
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Figure 25: Decomposing a polygon. The shaded part shows the reserved cells. 



We will analyze CellExplore by an induction over the cells in T . Starting 
with the given polygon, P, and the given start cell, s, we can define a 
sequence of polygons (Po,o := P) with start cells s^^i as follows: Pfc.j+i 
arises from P^.j by removing the start cell s^.i and all cells that are reserved 
in the first step in P^ j (i.e., every cell c with (/9(c) = Sfc^j). The start cell 
in the new polygon Pfc,j+i is the cell that the robot enters with its 
first step in Pfc^j, see Figure [25l The reserved cells in this and all following 
figures are shown shaded; A denotes the start cell. 

There is nothing to consider when the strategy enters the backward 
mode, because we remove all cells that are explored in the backward mode 
together with the forward cells. But what happens, if a division cell occurs; 
that is, the strategy switches from the backward mode to the forward mode? 



Lemma 13 If one of the cells reserved in the first step in P^^i is a division 
cell, then P^ i is split by removing s^^i and all cells that are reserved in 
this step (i.e., all cells c with ip(c) = Sk,i) into two or more not-connected 
components. 

Proof. Consider two components, Pi and P2, that are connected in P by 
some cells X d B. Let Cj be the first cell in X that is discovered on the 
return path, thus, P2 is entered via Cj. In our successive decomposition, 
ip{cj) is the start cell of a polygon P^^i- In Pk^i all cells explored before ^{cj) 
are already removed. If there would be another connection, C£, between Pi 
and P2 at this time, q would be discovered before Cj on the return path and 
P2 would have been entered via q in contradiction to our assumption that 
P2 is entered via Cj. Thus, Cj must be the last cell that connects Pi and P2. 

□ 

If one or more of the cells to be removed are division cells, the polygon 
Pfc^j is divided into subpolygons Pfc+i,0) Pfc+2,0) • • • that are analyzed sepa- 
rately, see Figure [26j See Figure [251 for a more comprehensive example for 
the successive decomposition of a polygon. 

Now, we are able give a first bound for the number of steps that CellExplore 
uses to explore a polygon. 
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P2 — Pk+2,0 





Pi — Pk+1,0 



Figure 26: Handling of division cells. 



Lemma 14 Let us assume that the cell behind and right hand to the robot's 
positioi^ is blocked. The number of steps, S, used to explore a polygon with 
C cells, E edges and H holes, is bounded by 



where L denotes the number of the robot's left turns. 

Proof. We observe the differences in the number of steps, cehs, edges and 
holes between Pk^i and Pk,i+i, and assume by induction that our upper 
bound for the length of the exploration tour holds for Pk^i+i and for the 
separated subpolygons Pk+j^. Therefore, we have to show that the limit is 
still not exceeded if we add the removed cells and merge the subpolygons. 
We want to show that the following inequation is satisfied in every step: 



Let G denote the "profit" made by CellExplore; that is, the difference 
between the actual number of steps and the upper bound. With G, the 
preceding inequation is equivalent to 



We have to consider three main cases: the division cells, the cells con- 
tained in J^L (left turns), and those contained in Tp^^R (forward steps and 
right turns). There is no need to consider right turns explicitly, because steps 
to the right can be handled as a sequence of forward steps, see Figure [271 The 
successive decomposition ends with one single cell {C = 1,E = A,H = 
for which S = ^ = C+\E + 11 + 21-2, holds (Induction base). 

®In other words, the ceU southeast to the robot's current position if the current direction 
is north. 



S< C+-E + H + 2L-3, 



S< C + -E + H + 2L-3. 
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Figure 27: Decomposing a step to the right into several forward steps. 



Division cells 

If one of the cells that are reserved in the first step in P^^i is a division 
cell, Pk^i is split into two polygons Pk+i,o {Pi for short) and Pk+2,0 
(P2 for short), see Lemma [T3j We assume by induction that our upper 
bound is achieved in both polygons: 

Ci + ^Ei + Hi + 2Li-Gi-S^ -3 = 0, ie {1, 2}. 

For the merge of Pi and P2 into one polygon P, we can state the 
following: 



s 


= Si 


+ S2 + AS 


E 


= El 


+ E2 + AE 


C 


= Ci 


+ C2 


H 


= Hi 


+ H2 


L 


= Li 


+ L2 


G 


= Gi 


+ G2 + Gs, 



where Gs denotes the profit made by merging the polygons. 
We want to show that our bound is achieved in P: 

G+^E + H + 2L-G-S-3 

= Gi + G2 + \{Ei + E2 + AE) + Hi + H2 + 2Li 

+ 2L2 — Gi — G2 — Gs — Si — S2 — AS — 3 
= Gi + ^Ei +H1 + 2L1-G1- Si-3 
=0 

+ C2 + 2^^^ + H2 + 2-L2 ~ G2 — S2 — 3 
=0 

+ ^AE -Gs-AS + 3 
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= 

^ Gs = ^AE-AS + 3 

Thus, for every polygon split we have to observe AE and AS. If 
Gs is positive, we gain some steps by merging the polygons, if Gs is 
negative, the merge incurs some costs. 

The different configurations for a polygon split can be assigned to 
several classes, depending on the number of common edges between Pi 
and P2 and the way, the robot returns from P2 to Pi — more precisely 
the distance between the step from Pi to P2 and the step from P2 to 
Pi, compare for example Figure ISOTSa) and (3b). Figure ISOT i) and (ii) 
show some instances of one class. Because the actual values for AE 
and AS depend on only these two parameters, we do not list every 
possible polygon split in Figure [30l but some instances of every class. 

The balances of the polygon splits are shown in table [TJ Two cases 
have a negative balance, so we need one more argument to show that 
these cases do not incur any costs. Observe that after splitting P^^i the 
polygon Pi starts with a cell from J^l in the cases (lb)-(5). Removing 
this block of 4 cells, we gain +2, see the first line of Figure [H2J This 
covers the costs for the polygon split in the cases (4a) and (5a). 

Forward steps 

We have to consider several cases of forward steps, see Figure [3TJ The 
table lists the differences in the number of steps (A5), cells (AC), 
edges (AE) and holes (AH) if we add the considered cells to Pk^i+i- 
The last column shows G = AC+\AE+AH-AS. After removing the 
observed block of cells, the remaining polygon must still be connected; 
otherwise, we would have to consider a polygon split first. Thus, there 
are some cases, in which AH must be greater than zero. 

It turns out that all cases have a positive balance, except those that 
violate the assumption in Lemma O that the cell behind and right 
hand to the robot's position is blocked, see the cases marked with (*) 
in Figure [3TJ Notice that the configurations shown in Figure [28] are 
left turns instead of forward steps! 

To show that we have analyzed all possible cell configurations for a step 
forward, we use the following observations. When the robot makes a 
step forward, we know the following: Both behind and left hand to 
the robot are walls (otherwise it would have turned left), in front of 
the robot is no wall (otherwise it could not make a step forward). 
Right hand to the robot may or may not be a wall. In the latter 
case, we have three edges of interest that may or may not be walls, 
yielding 2^ = 8 cases, which can be easily enumerated. Two special 
cases occur by taking into consideration that the robot may enter the 
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observed block of cells from the upper cell or from the right cell, see 
for example Figure ISTTSa) and (5b). 
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Figure 28: Configurations that arc steps to the left instead of forward steps. 

Steps to the left 

Possible cases of left turns are shown in the figures [5214351 As in the 
previous case, the last column shows the balance. Again, we observe 
that all cases with a negative balance of —3 are a violation of our as- 
sumption that the cell behind and right hand to the robot's position is 
blocked. The negative balances of —2 are compensated by the addend 
2L in our bound. 




(i) Block is entered from left (ii) Block is entered from upper cell 

AS- = 4 AS = 2 



Figure 29: Another class of left turns. 

When counting the number of steps, we assumed that the robot enters 
the block of cells from the same direction as it left the block (from 
the left as shown in the figures). The robot may enter the block from 
the upper cell as shown in Figure [291 but this would only increase the 
balance. 

The completeness of the cases can be shown with the same argument 
as in the previous case: We know that there is a wall behind the robot 
and left hand to the robot is no wall. Examining a block of four cells 
for a left turn, we have six edges that may or may not be walls, and we 
have three cells that may or may not be holes, yielding corresponding 
configurations of edges. □ 
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AS Gs 

(1) -2 2 

(2) -4 1 
(3a) -6 
(3b) -6 -2 2 
(4a) -8 -1 
(4b) -8 -2 1 
(4c) -8 -4 3 
(5a) -10 -2 
(5b) -10 -2 



Table 1: Balances of polygon splits 
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Figure 31: Cell configurations for forward steps. 
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Figure 32: Possible configurations for steps to tlie left (1). 
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Figure 33: Possible configurations for steps to tlie left (2). 
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Figure 35: Possible configurations for steps to the left (4). 
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Next, we want to discard the assumption we made in Lemma HM 

Lemma 15 The assumption that the cell behind and right hand to the 
robot's position is blocked can be violated only in the robot's initial position; 
that is, in Po,o o^^^ in any Pj,^i with A; + i > 0. 

Proof. 

Consider a robot located in some cell c with 
no wall behind and right hand to its position, and 
w.l.o.g. dir ='north'. If this is not the robot's initial 
position, but the position Sk,i of a polygon P^/i occur- 
ring in the successive decomposition of the polygon, 
the robot must have entered the cell c from the cell c' below c in the polygon 
Pk,i-i- If the cell c" right to c' is not a hole in Pfc,i_i, c" would be a reserved 
cell, and, thus, it would be removed together with c' in the step from Pk.i~i 
to Pk,i- Consequently, when the robot has reached c, there would be a hole 
behind and right hand to its current position. □ 




Lemma 16 The number of steps, S, used to explore a polygon with C cells, 
E edges and H holes, is bounded by 

S< C + + H + 2L-2, 
where L denotes the number of the robot's left turns. 

Proof. Lemma [15] shows that the assumption in Lemma [T^ can be violated 
only in the robot's initial position. On the other hand, we have seen in the 
proof of Lemma UM that all cases that violate the assumption incur the costs 
of just one additional step. □ 



(i) (ii) 

Figure 36: Corridors of odd width. 

Our bound still depends on the number of left turns the robot makes 
while exploring the polygon. To give a bound that does not depend on 
the strategy, we introduce another property of grid polygons, a measure 
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to distinguish rather flat polygons from winded polygons; let us call it the 
sinuosity of P. Our motivation for introducing this property is the following 
observation: The robot may walk n + 1 times through a corridor of width 
n, n odd, see Figure [36lfi). The costs for this extra walk are covered by the 
corridor walls; more precisely, for every double visit we charge two polygon 
edges and get the addend in the upper bound. If there is a left turn in 
the corridor, there are not enough boundary edges for balancing the extra 
walk. Figure [36111) shows a corridor of width 3 with a left turn. The steps 
shown with dashed lines cannot be assigned to edges, so we have to count the 
edges shown with dashed lines to balance the number of steps. We define 
two types of sinuosities, the clockwise and the counterclockwise sinuosity. 
Because CellExplore follows the left-hand rule, our strategy depends on the 
clockwise sinuosity. A similar strategy that follows the right-hand rule would 
depend on the counterclockwise sinuosity. 




(i) (ii) 
Figure 37; Contributions to Wcw by (i) the outer boundary, (ii) inner boundaries. 
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Figure 38: Reflex vertices pi and the corresponding squares Qi. Wcw = 9i + 93 = 
8, Wccw = 94 = 2. 
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Definition 17 Let the clockwise sinuosity, Wcw, and the counterclockwise 
sinuosity, Wccw, of a grid polygon P be defined as follows: We trace the 
boundary of P — the outer boundary clockwise, the boundaries of the holes 
inside P counterclockwise — , and consider every pair, pi and Pi+i, of con- 
secutive reflex vertices, see Figure [371 

We trace the angular bisector between the two edges incident to pi in- 
side P until it hits the boundary of P. The resulting line segment defines 
the diagonal of a square, Qi, see Figure ISsI^ Let qi be the width of Qi, 
analogously with qi+i. 

Because the robot needs some further steps only in odd corridors, we 
count only odd squares: 

/ _ f - 1, if Qi is odd 
^* \ 0, if Qi is even 

The need for additional edges may not only be caused by reflex vertices, but 
also by the start cell, see Figure ISW ii) . Thus, we consider the squares Qsc„ 
and Qsccw from the start cell in clockwise and counterclockwise direction, 
respectively. Let q'^^^ and q's^^^ be defined analogously to q[. Now, we define 
the clockwise sinuosity Wcw and the counterclockwise sinuosity VFccw as 

Wc^ : = q's^^^ +^q2i-i, and Wccw : = q's,^ +^q'2i- 

i>l i>l 

Figure [HU] shows two examples for the definition of Wcw Note that in (i) 
only one refiex vertex contributes to Wcw, and every edge we count here is 
needed. 




(i) (ii) 

Figure 39: Examples for the definition of Wcw- (i) A polygon with C = 193, -j = 
78, H = 3, Wcw = 6,S = 284 (the bound for S is exaetly achieved), (ii) the start 
cell contributes to Wcw, too (C = 46, f = 23, H = 2, Wcw = 2,5 = 74). 



We can construct Qi by "blowing up" a square around the cell in P that touches the 
boundary of P in pi until the corner of Qi opposite to pi hits the polygon's boundary. 
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With the definition of Wcw, we can give our final result: 

Theorem 18 Let P be a grid polygon with C cells, E edges, H holes, and 
clockwise sinuosity Wcw CellExplore explores P using no more than 

S< C + ^E + Wc^ + 3H -2 

steps. This bound is tight. 




(i) (li) 
Figure 40: Left turn followed by (i) a right turn and (ii) a reduction. 

Proof. We need some global arguments to charge the costs for a left turn 
to properties of P. So let us examine, which configurations may follow a left 
turn (after some forward steps): 

• A right turn follows the left turn, see Figure HUTi). We gain +2 steps 
per right turn, so the possible costs of —2 for this left turn are covered. 

• An obstacle follows the left turn. We can charge the obstacle with 
the costs for the left turn and get a factor of 3 for the number of 
obstacles. Every obstacle is charged at most once, because when the 
successive decomposition reaches the obstacle for the first time, the 
obstacle disappears; that is, the hole merges with the outer boundary. 

• A reduction follows the left turn, see Figure HOlfii). Later in this sec- 
tion, we show that a reduction covers the costs of a left turn. 

• Another left turn follows the observed left turn. In this case, there is 
no other property of P to be charged with this costs but the sinuosity 
Wcv!, this follows directly from the definition of Wcw 

In the case of a reduction following a left turn we observe the number, d, 
of forward steps between the left turn and the reduction, as well as the cell 
marked with 6, and — if d > 1 — the cell marked with a in Figure HHi). If b is 
blocked, b is either part of an obstacle inside the polygon or it is outside the 
polygon. In the first case, we charge the obstacle with the costs of the left 
turn as described earlier. In the second case we have a polygon split that 
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Figure 41: (i) A reduction follows a left turn, (ii) the left turn causes a polygon 
split, (iii) no forward steps between the left turn and the reduction {d ~ 0). 



leaves us with a left turn that incurs no costs, see Figure HTT ii) and the first 
line of Figure [32j The same holds for the cell marked with a if there is at 
least one forward step between the left turn and the reduction (i.e., d > 1). 
Therefore, we assume that a and b are free cells in the following. 

If the reduction follows immediately after the left turn {d = 0), see 
Figure I^TTiii). we have one of the left turns shown in Figure [33l Figure [35] or 
the lower half of Figure [Ml In any of these cases we have either a positive 
balance or we meet an obstacle {AH > 0) and charge the costs for the left 
turn to the obstacle as earlier. 




G = +2 G>-2 



(i) (ii) 



Figure 42: If a and b are free cells we gain 2. (i) d = 1, (ii) d> 1. 



If there is one forward step between the left turn and the reduction 
{d = 1), the robot enters the 2x2 block of cells of the left turn not from the 
same side as it left it, because a and b are polygon cells. In Figure [l^Ti) the 
robot leaves the block to the left but enters it from above. This situation 
is described in Figure [29] on page [M] and reduces the costs for the left turn 
by 2, so the balance is either zero or positive. If there is more than one 
forward step {d > 1), we have either the same situation as in the preceding 
case, or the reduction from a corridor of width > 3 to a corridor of width 
< 2 shifts to the left, see Figure [l2lfii). and eventually we reach a forward 
step as shown in Figure [STT Sb) that gains +2 and covers the costs for the 
left turn. 

Altogether, we are able to charge the costs for every left turn to other 
properties, which proves our bound. 

Figure [3U] and Figure [13 show nontrivial examples (i.e., H ^ and 
7^ 0) for polygons in which the bound is exactly achieved. □ 
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Figure 43: Polygon with C 
3H + T^cw - 2. 

6 Summary 
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= 34,f = 17, H = 1,W, 



= 2,S = M = C + + 



We considered the exploration of grid polygons. For simple polygons we 
have shown a lower bound of | and presented a strategy, SmartDFS, that 
explores simple polygons with C cells and E edges using no more than 
C + — 3 steps from cell to cell. Using this upper bound, we were able 
to show that SmartDFS is in fact |-competitive, leaving a gap of only g 
between the upper and the lower bound. 

On the other hand, the competitive complexity for the exploration of grid 
polygons with holes is 2. A simple DFS exploration already achieves this 
competitive factor, but, nevertheless, DFS is not the best possible strategy, 
because is it not necessary to visit each cell twice. Therefore, we developed 
the strategy CellExplore that takes advantage of wider areas in the polygon, 
and thus corrects the weakness in the DFS strategy. 

Interesting open questions are, how SmartDFS and CellExplore can be 
generalized to higher dimensions and other cell types than squares, e.g. 
triangular or hexagonal cells. 
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